Automated Detection of Spammers’ Profiles using Improved K-Means in Twitter

نویسندگان

  • Sanjeev Dhawan
  • Kulvinder Singh
چکیده

In the digital world of applications, a new application called twitter made a major impact in online social networking and micro blogging. The communication between users is through text based post. Its popularity also attracts many spammers to infiltrate legitimate users account with large amount of spam messages .Online social networking platforms are providing us with a large scale platform to study the human behavior. This paper improves K Means algorithm to separate human from not human users in order to identify normal human activity. K-Means is one of the simplest unsupervised learning algorithms that solve the well known clustering problem. Here in this research paper in order to achieve the target results with better accuracy, an efficient approach will be designed by modifying sequential K-Means clustering algorithm to detect spam in Twitter. The data which has been provided for the entire process to be performed is extracted from the social networking website Twitter with the help of R Package as it provides interface with the Twitter web API (Application Programmable Interface). Various calculations has been performed to calculate the accuracy, precision, and recall and based on these results and respective graphs have been obtained.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Yang, Harkreader and Gu: Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers

To date, as one of the most popular Online Social Networks (OSNs), Twitter is paying its dues as more and more spammers set their sights on this microblogging site. Twitter spammers can achieve their malicious goals such as sending spam, spreading malware, hosting botnet command and control (C&C) channels, and launching other underground illicit activities. Due to the significance and indispens...

متن کامل

Die Free or Live Hard? Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers

To date, as one of the most popular Online Social Networks (OSNs), Twitter is paying its dues as more and more spammers set their sights on this microblogging site. The Twitter spammers can achieve their malicious goals such as sending spam, spreading malware, hosting botnet command and control (C&C) channels, and launching other underground illicit activities. Due to the significance and indis...

متن کامل

An analysis of 14 Million tweets on hashtag-oriented spamming

Over the years, Twitter has become a popular platform for information dissemination and information gathering. However, the popularity of Twitter has attracted not only legitimate users but also spammers who exploit social graphs, popular keywords, and hashtags for malicious purposes. In this paper, we present a detailed analysis of the HSpam14 dataset, which contains 14 million tweets with spa...

متن کامل

Spammers Are Becoming "Smarter" on Twitter

T witter has become one of the most commonly used communication tools in daily life. With 500 million users, Twitter now generates more than 500 million tweets per day. However, its popularity has also attracted spamming. Spammers spread many intensive tweets, which can lure legitimate users to commercial or malicious sites containing malware downloads, phishing, drug sales, scams, and more.1 S...

متن کامل

Spam Filtering in Twitter Using Sender-Receiver Relationship

Twitter is one of the most visited sites in these days. Twitter spam, however, is constantly increasing. Since Twitter spam is different from traditional spam such as email and blog spam, conventional spam filtering methods are inappropriate to detect it. Thus, many researchers have proposed schemes to detect spammers in Twitter. These schemes are based on the features of spam accounts such as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016